SimMatrix: SIMulator for MAny-Task computing execution fabRIc at eXascale
نویسندگان
چکیده
Exascale computing have challenges, most of which can be potentially addressed by Many-task computing paradigm through efficient task execution frameworks that are several orders of magnitude beyond current batch schedulers. This paper proposes a light-weight discrete event simulator, SimMatrix, which simulates distributed job scheduler comprising of millions of nodes and billions of cores/tasks. We validated SimMatrix against MATRIX up to 4K-cores, running on an IBM Blue Gene/P system, and compared SimMatrix with SimGrid and GridSim in terms of resource consumption at scale. Results show that SimMatrix consumes up to two-orders of magnitude lower memory per task, and at least one-order of magnitude (and up to four-orders of magnitude) lower time per task overheads.
منابع مشابه
SimMatrix: SIMulator for MAny-Task computing execution fabRIc at eXascales
Exascale computers will enable the unraveling of significant scientific mysteries. Predictions are that by 2019, supercomputers will reach exascales with millions of nodes and billions of threads of execution. Many-task computing (MTC) is a new viable distributed paradigm for extreme-scale supercomputing. The MTC paradigm can address four of the five major challenges of exascale computing, name...
متن کاملPaving the Road to Exascale with Many-Task Computing
Exascale systems will bring significant challenges. This work attempts to addresses them through the Many-Task Computing (MTC) paradigm, by delivering data-aware job scheduling systems and fully asynchronous distributed architectures. MTC applications are structured as DAG graphs of tasks, with dependencies forming the edges. The asynchronous nature of MTC makes it more resilient than tradition...
متن کاملMATRIX: MAny-Task computing execution fabRIc at eXascale
Efficiently scheduling large number of jobs over large-scale distributed systems is critical in achieving high system utilization and throughput. Today’s state-of-the-art job management systems have predominantly Master/Slaves architectures, which have inherent limitations, such as scalability issues at extreme scales (e.g. petascales and beyond) and single point of failure. In designing the ne...
متن کاملAn Effective Task Scheduling Framework for Cloud Computing using NSGA-II
Cloud computing is a model for convenient on-demand user’s access to changeable and configurable computing resources such as networks, servers, storage, applications, and services with minimal management of resources and service provider interaction. Task scheduling is regarded as a fundamental issue in cloud computing which aims at distributing the load on the different resources of a distribu...
متن کاملCentralized and Distributed Job Scheduling System Simulation at Exascale
Job scheduling systems aim to efficiently manage the distributed computing power of workstations, servers and supercomputers to maximize throughput. As the development of high-speed networking and powerful supercomputers, the scheduling systems should be up to the exascale that is billions of jobs executed by millions of nodes with thousands of cores each, which is however, beyond the state-of-...
متن کامل